Korpus: hbs-ba_newscrawl_2011_300K

Weitere Korpora

3.7.1 String similarity graph for words

General information for Levenshtein distance for words. The data are considered as graph with words as nodes and edges weighted with Levenshtein similarity. Only the top 1.000.000 words are considered.

Number of nodes in the top-1M Levenshtein graph
Number of nodes
247142
Number of edges in the top-1M Levenshtein graph
Number of edges
1528448
Minimum word length
Word length
6
Edge weights in the top-1M Levenshtein graph
Levenshtein distance Number of edgeses
0 29083
1 384956
2 1114409
2120 msec needed at 2018-03-06 11:49